Inferring Hierarchical Clustering Structures by Deterministic Annealingby Deterministic Annealing
نویسندگان
چکیده
The unsupervised detection of hierarchical structures is a major topic in unsupervised learning and one of the key questions in data analysis and representation. We propose a novel algorithm for the problem of learning decision trees for data clustering and related problems. In contrast to many other methods based on successive tree growing and pruning, we propose an ,aL”G.,P 4Lnrt;nn C.-e hM ,a~“lr,lt;n” selrl WP clP~iVc= 1 “YJ~~“‘.U &UYI”I”Y A”. “III ” .uIUYYIVY -.. . . u x.1*. . u Y non-greedy technique for tree growing. Applying the principles of maximum entropy and minimum cross entropy, a deterministic annealing algorithm is derived in a meanfield approximation. This technique allows us to canonically superimpose tree structures and to fit parameters to averaged or ‘fuzzified’ trees.
منابع مشابه
Inferring Hierarchical Clustering Structures by Deterministic Annealing
The unsupervised detection of hierarchical structures is a major topic in unsupervised learning and one of the key questions in data analysis and representation. We propose a novel algorithm for the problem of learning decision trees for data clustering and related problems. In contrast to many other methods based on successive tree growing and pruning, we propose an objective function for tree...
متن کاملConstrained Clustering as an Optimization Method
Our deterministic annealing approach to clustering is derived on the basis of the principle of maximum entropy, is independent of the initial state, and produces natural hierarchical clustering solutions by going through a sequence of phase transitions. This approach i s modified here for a larger class of optimization problems by adding constraints to the free energy. The concept of constraine...
متن کاملHierarchical Pairwise Data
Partitioning a data set and extracting hidden structure arises in diierent application areas of pattern recognition, data analysis and image processing. We formulate data clustering for data characterized by pairwise dissimilarity values as an assignment problem with an objective function to be minimized. An extension to tree{structured clustering is proposed which allows a hierarchical groupin...
متن کاملDistributional Similarity, Phase Transitions and Hierarchical Clustering
We describe a method for automatically clustering words according to their distribution in particular syntactic contexts. Words are represented by the relative frequency distributions of contexts in which they appear, and relative entropy is used to measure the dissimilarity of those distributions. Clusters are represented by "typical" context distributions averaged from the given words accordi...
متن کاملImproved clustering using deterministic annealing with a gradient descent technique
Various techniques exist to solve the non-convex optimization problem of clustering. Recent developments have employed a deterministic annealing approach to solving this problem. In this letter a new approximation clustering algorithm, incorporating a gradient descent technique with deterministic annealing, is described. Results are presented for this new method, and its performance is compared...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999